anchor word
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Europe > Italy > Lazio > Rome (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- (2 more...)
- North America > United States > Maryland (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > California (0.04)
- (3 more...)
- North America > United States > Michigan (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.04)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
- Asia > Middle East > Jordan (0.04)
- North America > United States > Utah (0.04)
- (4 more...)
- North America > United States > Maryland (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > California (0.04)
- (3 more...)
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Europe > Italy > Lazio > Rome (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- (2 more...)
- Asia > Middle East > Jordan (0.04)
- North America > United States > New Jersey > Middlesex County > New Brunswick (0.04)
- North America > United States > New Jersey > Mercer County > Princeton (0.04)
Robust Spectral Inference for Joint Stochastic Matrix Factorization
Moontae Lee, David Bindel, David Mimno
Spectral inference provides fast algorithms and provable optimality for latent topic analysis. But for real data these algorithms require additional ad-hoc heuristics, and even then often produce unusable results. We explain this poor performance by casting the problem of topic inference in the framework of Joint Stochastic Matrix Factorization (JSMF) and showing that previous methods violate the theoretical conditions necessary for a good solution to exist. We then propose a novel rectification method that learns high quality topics and their interactions even on small, noisy data. This method achieves results comparable to probabilistic techniques in several domains while maintaining scalability and provable optimality.
- North America > United States > New York > Tompkins County > Ithaca (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > Nevada (0.04)
- Leisure & Entertainment (0.68)
- Media > Film (0.46)
Prompt Tuning for Few-Shot Continual Learning Named Entity Recognition
Knowledge distillation has been successfully applied to Continual Learning Named Entity Recognition (CLNER) tasks, by using a teacher model trained on old-class data to distill old-class entities present in new-class data as a form of regularization, thereby avoiding catastrophic forgetting. However, in Few-Shot CLNER (FS-CLNER) tasks, the scarcity of new-class entities makes it difficult for the trained model to generalize during inference. More critically, the lack of old-class entity information hinders the distillation of old knowledge, causing the model to fall into what we refer to as the Few-Shot Distillation Dilemma. In this work, we address the above challenges through a prompt tuning paradigm and memory demonstration template strategy. Specifically, we designed an expandable Anchor words-oriented Prompt Tuning (APT) paradigm to bridge the gap between pre-training and fine-tuning, thereby enhancing performance in few-shot scenarios. Additionally, we incorporated Memory Demonstration Templates (MDT) into each training instance to provide replay samples from previous tasks, which not only avoids the Few-Shot Distillation Dilemma but also promotes in-context learning. Experiments show that our approach achieves competitive performances on FS-CLNER.
- Europe > United Kingdom > England (0.04)
- Oceania > Australia (0.04)
- North America > Cuba > Guantánamo Province > Guantánamo (0.04)
- (8 more...)
Testing Hypotheses of Covariate Effects on Topics of Discourse
Phelan, Gabriel, Campbell, David A.
We introduce an approach to topic modelling with document-level covariates that remains tractable in the face of large text corpora. This is achieved by de-emphasizing the role of parameter estimation in an underlying probabilistic model, assuming instead that the data come from a fixed but unknown distribution whose statistical functionals are of interest. We propose combining a convex formulation of non-negative matrix factorization with standard regression techniques as a fast-to-compute and useful estimate of such a functional. Uncertainty quantification can then be achieved by reposing non-parametric resampling methods on top of this scheme. This is in contrast to popular topic modelling paradigms, which posit a complex and often hard-to-fit generative model of the data. We argue that the simple, non-parametric approach advocated here is faster, more interpretable, and enjoys better inferential justification than said generative models. Finally, our methods are demonstrated with an application analysing covariate effects on discourse of flavours attributed to Canadian beers.
- Asia > India (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Asia > Middle East > Jordan (0.04)
- (6 more...)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.69)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)